9                    Software Design Issues _f_o_r _t_h_e
9              _P_S-_1_8_6 _A_d_v_a_n_c_e_d _P_a_c_k_e_t _N_e_t_w_o_r_k _C_o_n_t_r_o_l_l_e_r


          Brian Kantor, WB6CYT
   _A_c_a_d_e_m_i_c _N_e_t_w_o_r_k _O_p_e_r_a_t_i_o_n_s _G_r_o_u_p
      _O_f_f_i_c_e _o_f _A_c_a_d_e_m_i_c _C_o_m_p_u_t_i_n_g
  _U_n_i_v_e_r_s_i_t_y _o_f _C_a_l_i_f_o_r_n_i_a, _S_a_n _D_i_e_g_o












































9


                           December 6, 1988





                                - 2 -


                 Abstract
_A _f_a_s_t _n_e_t_w_o_r_k _f_o_r _a_m_a_t_e_u_r _r_a_d_i_o  _r_e_q_u_i_r_e_s
_s_o_p_h_i_s_t_i_c_a_t_e_d  _n_o_d_e  _c_o_n_t_r_o_l_l_e_r_s  _t_o  _w_o_r_k
_w_e_l_l.  _K_e_y _t_o _t_h_e _p_e_r_f_o_r_m_a_n_c_e _o_f  _a_d_v_a_n_c_e_d
_n_o_d_e  _c_o_n_t_r_o_l_l_e_r _h_a_r_d_w_a_r_e _i_s _t_h_e _d_e_s_i_g_n _o_f
_t_h_e _o_n-_b_o_a_r_d _s_o_f_t_w_a_r_e.  _I_s_s_u_e_s _o_f  _h_i_g_h_l_y-
_e_f_f_i_c_i_e_n_t  _d_e_v_i_c_e _d_r_i_v_e_r_s, _p_r_o_t_o_c_o_l _e_n_c_a_p_-
_s_u_l_a_t_i_o_n, _a_n_d _p_r_o_c_e_s_s _m_a_n_a_g_e_m_e_n_t  _m_u_s_t  _b_e
_a_d_d_r_e_s_s_e_d _t_o _e_n_s_u_r_e _a_c_c_e_p_t_a_b_l_e _p_e_r_f_o_r_m_a_n_c_e
_w_i_t_h   _l_i_m_i_t_e_d   _m_e_m_o_r_y   _a_n_d   _a_f_f_o_r_d_a_b_l_e
_h_a_r_d_w_a_r_e.   _P_S-_1_8_6  _h_a_r_d_w_a_r_e _d_e_s_i_g_n _i_s_s_u_e_s
_a_r_e _d_i_s_c_u_s_s_e_d  _i_n  _a  _c_o_m_p_a_n_i_o_n  _p_a_p_e_r  _b_y
_M_i_c_h_a_e_l  _B_r_o_c_k,  _F_r_a_n_k_l_i_n _A_n_t_o_n_i_o, _a_n_d _T_o_m
_L_a_F_l_e_u_r.


_1.  _I_n_t_r_o_d_u_c_t_i_o_n

     A high-throughput data  network  must
consist  of both high speed links and fast
network node controllers.  To achieve  the
high throughput in the controller requires
both good hardware and efficient software.

     The PS-186 offers a highly  efficient
hardware  design including very high speed
input/output and  a  fast  processor.  The
PS-186's  high-speed  DMA  channels  allow
much of the I/O  to  proceed  in  parallel
with computation, thus overlapping I/O and
processing that in  a  less  sophisticated
system might need to proceed serially.

     However,  even  the   most   advanced
hardware  can  be  crippled by inefficient
software that wastes CPU and I/O resources
rather  than  applying them to useful pro-
cessing. To take  full  advantage  of  the
PS-186 architecture, we have chosen to use
a multi-tasking system  that  can  support
several   programs  running  at  once.  By
dividing up  tasks  into  those  that  are
time-critical  and  those that are not, we
can set up the critical tasks in the  sys-
tem   such  that  they  will  receive  the
required  CPU  attention.  Less   critical
tasks will proceed as time permits.

9_________________________
Author's current address:
_e_l_e_c_t_r_o_n_i_c: brian@sdcsvax.ucsd.edu
_p_a_p_e_r: Office of Academic Computing B-028, La Jolla, CA 92093 USA




                           December 6, 1988





                                - 3 -


     The  PS-186  multi-tasking  operating
system  is based in general terms upon the
UNIX8r9 and other similar  simple  operating
systems.  In particular, many of the ideas
and practices used have  been  taken  from
those presented in the MINIX [10] and XINU
[2,3] model operating systems.

_2.  _S_o_f_t_w_a_r_e _O_r_g_a_n_i_z_a_t_i_o_n

     The PS-186 operating software can  be
divided into two categories.  One of these
is the central or core part of the system,
referred to as the _k_e_r_n_e_l. It is responsi-
ble  for  all  supervisory  low-level  I/O
functions,  process and memory management,
interrupt  handling,  and  initialization,
and  time-critical  tasks.  The  remaining
software  is   termed   the   _u_s_e_r   level
software,  although  there  are,  strictly
speaking, no users on  this  system.   The
true  distinction  is that while there may
be many ``user'' processes running at  one
time, there is only one kernel.

     User processes can be  stopped  while
they  are  waiting for input, or while the
kernel is handling some other  event  such
as an arriving packet.  They are typically
used  for  purposes  that  are  not  time-
critical  and  that  can  operate indepen-
dently of particular hardware  status.   A
few examples of tasks that might better be
placed in user processes are  table  look-
ups,  help  menu  displays,  and the like.
These  are  things  that  can  proceed  in
parallel with other similar tasks.

     The kernel, on  the  other  hand,  is
strictly  _s_i_n_g_l_e-_t_h_r_e_a_d_e_d  -  it  can only
execute by itself,  and  is  intended  for
those  tasks  that  need to have exclusive
access to the processor or  devices,  such
as interrupt handlers and device drivers.

     User processes do NOT directly access
devices,  nor  do  they handle interrupts.
All user processes  communicate  with  the
kernel by means of _s_y_s_t_e_m _c_a_l_l_s that cause
the kernel to perform some task on  behalf
_________________________
UNIX is a registered trademark  of  AT&T  Bell  Labora-
tories
9


                           December 6, 1988





                                - 4 -


of the user process.  A common example  is
a  read or a write - data transfer between
a user process and a device.

     The kernel is  responsible  for  per-
forming  all encapsulating protocols below
some arbitrary level, which we have chosen
to  be at the ``data stream'' level.  That
means that when  an  AX.25  connection  is
made to the PS-186, the kernel software is
responsible  for  the   acceptance,   ack-
nowledgement,  and  eventual  knockdown of
the connection.  The kernel  will  extract
the  data  field  from  the incoming AX.25
packet and make it  available  to  a  user
process  executing  an  appropriate  _r_e_a_d.
Likewise, a user process  that  wishes  to
send  data  over  an open AX.25 connection
will give the data to the kernel by  means
of  a  _w_r_i_t_e  system  call, and the kernel
will do  the  encapsulation  necessary  to
send the data via AX.25, and then queue it
for transmittal by the appropriate device.

     When  there  are  multiple  protocols
involved,  such  as  TCP-IP  on AX.25, the
kernel does the multiple  extractions  and
encapsulations  as  required,  so that the
user process again works only at the  data
level.

     The decision on whether  to  place  a
particular  task in the kernel or leave it
to user-level processing  is  based  on  a
number  of  criteria, some of them empiri-
cal.  In general, any task which can  wait
to  complete without impacting the perfor-
mance of  other  tasks  can  generally  be
placed  in  a  user-level process, whereas
tasks that have a number of process depen-
dencies  pretty much have to go inside the
kernel.  Additionally, any protocol encap-
sulation  or unwrapping that does not gen-
erate additional packets can be placed  in
the  kernel,  thereby making that function
available to all user-level  processes  by
means of a uniform system call.

_3.  _I_n_p_u_t-_O_u_t_p_u_t

     Each  device  in  the  PS-186  has  a
module  of  code associated with it in the
kernel  that  does  the  low-level  input-
output  interface  to the actual hardware.


9                           December 6, 1988





                                - 5 -


This module is often referred  to  in  the
literature as a _d_e_v_i_c_e _d_r_i_v_e_r.

     At the low level hardware  interface,
a  device driver is responsible for taking
data to be output  from  some  generalized
system  data  structure, and actually out-
putting it through the corresponding piece
of hardware.  It also must accept incoming
data from the hardware device and place it
into  a  system data structure for further
processing.  It  is  common   for   device
drivers  to operate in an _i_n_t_e_r_r_u_p_t-_d_r_i_v_e_n
mode, with their actions being invoked  in
response  to  ``completion''  or ``ready''
signals from the hardware. We have  chosen
this  method over a perhaps simpler scheme
where  the  main  software   loop   simply
repetitively  checks for device availabil-
ity, because the latter scheme potentially
wastes  a tremendous portion of the avail-
able processing power.  Additionally,  the
various  drivers  make use of the PS-186's
DMA (Direct Memory Access)  capability  to
move  the  actual  data between memory and
the device without the need of the CPU  to
read and write every byte.

     There also must be a simple and  con-
sistent higher-level interface to the sys-
tem data  structures  that  the  low-level
device  drivers access.  We have chosen to
implement this interface as _r_e_a_d and _w_r_i_t_e
system  calls  that invoke high-level por-
tions of the device drivers. Additionally,
there  are both high- and low-level confi-
guration, status, and initialization func-
tions  that are logically part of the dev-
ice  driver.   Thus  each  driver  can  be
divided   into   two   logical  functions,
referred to as the ``top'' and  ``bottom''
of the driver.

     The ``bottom'' function is the inter-
face  to  the  hardware;  it is invoked in
response to an interrupt from  the  actual
device.  Typically its sole function is to
move data to and from the  device  and  an
associated memory buffer or buffer queue.

     The ``top'' function is the interface
to the kernel _r_e_a_d and _w_r_i_t_e system calls.
It does the opposite of its  corresponding
``bottom''  half;  where  the  bottom half


9                           December 6, 1988





                                - 6 -


places  incoming  hardware  data  into   a
buffer,  the top half will remove the data
from the buffer and give it to the process
executing the _r_e_a_d kernel call.

     When the PS-186 kernel has data to be
sent  out of a serial port (in response to
a _w_r_i_t_e system call), the device driver is
called  to  accept the outgoing data.  The
driver adds the data to the tail end of  a
queue  of  data waiting to be sent, sets a
flag indicating that there is indeed  data
to  be sent, and returns.  Later, when the
output device finishes with  the  data  it
was sending, it will cause an interrupt to
occur, and  the  ``bottom  half''  of  the
appropriate  device  driver  will take the
next chunk of data from the queue and send
it  to  the  device.  Thus the kernel (and
therefore user processes)  need  not  wait
for  I/O  on  a  device to complete before
resuming proceeding.

     Data buffers and queues  are  dynami-
cally  allocated; when data is received or
generated a ``buffer'' (a block of memory)
is  allocated  from  the pool of available
memory to hold it, and a ``pointer''  that
contains  the  memory address of the block
is set up.  To save the time that would be
wasted in copying from one block of memory
to another, data is passed from module  to
module  by  passing  the  pointer  to  the
memory buffer in which the  data  resides,
rather  than copying the data itself. When
the data is finally  consumed,  either  by
being output by a device, or copied into a
user-level (outside the kernel) buffer  by
a  _r_e_a_d system call, the memory space used
is returned to the available memory  pool,
and the buffer pointer is dereferenced.

     When a call to the kernel  with  data
to  be output (a _w_r_i_t_e call) would require
allocation of  more  memory  buffer  space
than  is  allowed,  the process making the
call is stopped by the simply not  return-
ing  from  the system call to that process
until there is space and the write can  be
completed.  Since ``blocking'' the process
in this manner  does  NOT  stop  interrupt
service  nor  other  kernel functions, the
device will eventually output enough  data
to free up sufficient memory for the write


9                           December 6, 1988





                                - 7 -


to complete and for the  user  process  to
resume.

     On input, if a chunk of data  arrives
and  there  is no memory available to hold
it, the only  practical  procedure  is  to
simply  discard  the  data.  We anticipate
having a large amount of memory  available
for  data  buffers,  as  well as expecting
good throughput, so we do  not  anticipate
that  it  will  necessary  to discard data
often.   As  a  practical  note,  we  have
decided  to provide each input device with
its own memory buffer limit so that no one
device  could hog all available memory and
shut out input from other devices even  in
the most pathalogical of cases.  The over-
riding  assumption  is  that  higher-level
protocols  will handle packets lost due to
memory congestion in  much  the  same  way
that  packets  lost  due  to collisions or
channel congestion are handled.

     A kernel _r_e_a_d call will  return  data
from  the input queue to the user process;
if there is no data in the queue, the user
process  may  elect to wait until there is
(``read-wait'') or  just  return  (``read-
no-wait'').  When data is available, it is
copied into a buffer space provided by the
user   process   (typically   a  character
array), and the  memory  buffer  space  is
released  to be reused on subsequent input
events.

     One can view the input-output streams
as  a series of filtered interfaces to the
raw packets that  are  being  received  or
sent.   Thus it is possible to open a con-
nection that consists of raw AX.25 frames,
an AX.25 connected mode stream, IP packets
in SLIP, IP packets in AX.25, TCP in IP in
AX.25,  etc.   This  is  controlled by the
parameters passed to  the  kernel  in  the
_o_p_e_n system call.

_4.  _D_e_v_i_c_e_s

     The  PS-186  devices  that  are  most
interesting  are  the  several serial con-
troller chips that form the communications
interfaces.   (There is an SCSI controller
option for general device access, such  as
to  a  disk  or  floppy controller, but we


9                           December 6, 1988





                                - 8 -


will not discuss that  here.)  The  serial
controller  chosen was the Zilog 8530 SCC;
the hardware  design  considerations  that
lead  to  it being chosen are discussed in
the companion paper on the hardware design
of the PS-186.

     The 8530 SCC can do both asynchronous
serial  I/O  (as  perhaps to a terminal or
printer), and HDLC synchronous, such as is
used  in  the  AX.25 protocol.  Any of the
PS-186's serial ports can be configured to
operate  in  either  of  these  modes.  We
therefore  have  a  more  complex   device
driver than if the PS-186 had fixed serial
port allocations, since the device  driver
must  be  able  to  handle  both sync- and
async-configured devices based on  parame-
ters  stored  in  a  table.  The driver is
also responsible for setting up the  modes
of the serial ports in the first place.

_5.  _P_r_o_t_o_c_o_l _H_a_n_d_l_i_n_g

     Fundamental to the operation of  com-
munications  protocols  is  the concept of
_l_a_y_e_r_i_n_g or _e_n_c_a_p_s_u_l_a_t_i_o_n, whereby data is
successively  encapsulated  or ``wrapped''
in layers of protocol as  it  is  prepared
for transmission, and ``unwrapped'' at its
destination.

     The basic concept used is known as  a
_s_w_i_t_c_h.  As a railroad switch controls the
path of a train, the switch  controls  the
path  that  data takes through the various
levels of encapsulation and unwrapping.  A
_p_r_o_t_o_c_o_l  _s_w_i_t_c_h  makes  the  data passing
decision based on  a  field  contained  in
each protocol's header that indicates what
kind of protocol may be  further  encapsu-
lated within the data field of the current
packet.

     The protocol handling scheme that  we
chose  to  use in the PS-186 is located in
the kernel software. By keeping all proto-
col  wrapping  and  unwrapping  inside the
main single-thread portion of the  operat-
ing  system and thus making them available
to all processes on the  system,  we  sim-
plify  greatly the amount of protocol han-
dling  required  in  the   various   other
processes.


9                           December 6, 1988





                                - 9 -


     Each PS-186 communications  interface
is  configured  at  system startup time to
handle one type of outermost protocol (for
example,  KISS  AX.25 is appropriate for a
serial  interface  to  a  radio  link,  or
perhaps  SLIP  for  a hardwire line.) As a
packet is received from an  interface,  it
is  examined  according  to the rules that
have been set up for that interface.  When
that  packet  has  been  received  and the
appropriate  acknowledgements   generated,
the  contents  of  the packet and selected
fields  extracted  from  its  header   are
passed to the appropriate protocol switch.
The  protocol  switch  then  examines  the
packet   contents   and  routes  the  data
further to the next  protocol  module,  as
appropriate.   This  process repeats until
there is  no  further  enclosed  protocol,
until  the  data  has been fully extracted
and is available in a buffer queue  to  be
used  by some user process.  Not until the
data has  been  fully  extracted  does  it
become  ready to leave the kernel environ-
ment.

     A  concrete  example  may  make  this
clearer:  Suppose that we receive an AX.25
packet on a serial link that  is  attached
to  a  radio. If it is for us (as shown by
the destination  callsign)  we  will  ack-
nowledge  that  AX.25 packet (if appropri-
ate), and if it was a data packet (UI or I
frame),  we will pass it to the AX.25 pro-
tocol switch.  That  switch  will  examine
the  Protocol  ID byte that is part of the
AX.25 packet.  If the PID is for a  stream
connection (a normal mode AX.25 connection
such as  is  commonly  in  use  today  for
keyboard-to-keyboard   typing),  then  the
packet will be  further  switched  by  the
AX.25  Stream  switch,  which will send it
(based on the callsign in the source field
of  the  AX.25  header, since there can be
only one connection per  source  callsign)
to the user process that is servicing that
stream connection.

     If, instead,  the  PID  is  for  ARP,
RARP,  RIP, or one of the other raw packet
protocols, the data will be  sent  to  the
user  process  that  handles  that kind of
packet  -  to  build  address  or  routing
tables, for example.


9                           December 6, 1988





                                - 10 -


     A packet with the PID  indicating  an
encapsulated  IP  packet  is passed to the
module that does IP protocol  -  checksums
and other integrity checks.  If the packet
is ok by IP standards, the IP module  will
call  the  IP  protocol switch, which will
examine the Protocol ID  byte  in  the  IP
packet (distinct from the PID in the AX.25
packet).  This will in turn route  the  IP
packet  to  another protocol handler, such
as UDP, ICMP, RDP, or TCP -  whichever  we
have   implemented.    Again,   those  are
expected to route the data based on fields
in the headers of these protocols.

     TCP  is  an  interesting  example  of
imbedded protocols and switching. Each TCP
connection as seen on a host is designated
uniquely  by  a  64-bit number that is the
concatenation of the distant host's Inter-
net  address  (32  bits) and the local and
distant TCP logical port numbers (16  bits
each).   Since  when  a  TCP connection is
initiated, the originating host must chose
a  new  (not  currently nor recently used)
logical source port number, there  can  be
multiple  logical connections between TCPs
on the same two hosts  even  to  the  same
distant  port.   The  data  switch  in the
receiving TCP is required to separate  out
the  streams of data based upon the 64-bit
stream identifier,  and  deliver  each  to
potentially  separate  user  processes  as
appropriate.

_6.  _P_r_o_c_e_s_s _C_o_n_t_r_o_l

     The PS-186 is organized as  a  multi-
tasking  system,  implying  that more than
one process may  be  running  at  a  time.
There is always a ``null'' process that is
constantly ready to run; when there is  no
other  process ready to run, the null pro-
cess is active.

     Processes are created as  needed  and
destroyed  when no longer needed.  In this
manner,  resources  are  not  consumed  on
idling  processes  that are merely sitting
around waiting in case  they  are  needed.
For  example,  when an AX.25 connection is
made to the PS-186 network node, a process
is started to handle incoming stream data.
This process will exit and  its  resources


9                           December 6, 1988





                                - 11 -


will be deallocated when the connection is
closed.  Each such connection will cause a
separate process to be spawned.

     User-level processes do  not  perform
I/O  operations  to  devices; they instead
make  _s_y_s_t_e_m  _c_a_l_l_s  to  the  kernel  that
invoke  the  required I/O.  When a process
makes a system  call  that  would  require
some  time to complete (such as I/O), that
process is _b_l_o_c_k_e_d - that  is,  placed  in
suspended  state,  and  another process is
resumed.   Periodically  (in  response  to
interrupts   from   the  system  real-time
clock),  the  current  process   will   be
suspended  and  another  selected  to run.
Thus no process can hog CPU resources, and
I/O  can proceed in parallel with ordinary
processing.

     We feel that multitasking is a  supe-
rior  method in this application, although
it is much more  complex  than  a  single-
threaded  program,  because  much  of what
goes on in a device like the PS-186 is not
time-critical, and we can therefore devote
the CPU to high-priority events  (such  as
the  arrival  and  buffering  of a packet)
that are truly critical.

_7.  _C_o_n_c_l_u_s_i_o_n

     We  feel  that  the  PS-186  Advanced
Packet Controller represents a significant
step towards the construction of an  effi-
cient  and  practical  amateur  radio data
network.  By combining fast  hardware  and
efficient software into a flexible package
that  accomodate  today's  and  tomorrow's
protocols, we believe we have advanced the
network one step further along the road to
completion.

_8.  _R_e_f_e_r_e_n_c_e_s

[1]  AT&T,    ``Communications    Protocol
     Specification   BX.25'',  Publication
     54001 Issue 2 (June 1980)

[2]  Comer, D., _O_p_e_r_a_t_i_n_g _S_y_s_t_e_m _D_e_s_i_g_n  -
     _t_h_e   _X_I_N_U   _A_p_p_r_o_a_c_h,  Prentice-Hall
     (1984)

9


                           December 6, 1988





                                - 12 -


[3]  Comer, D., _O_p_e_r_a_t_i_n_g _S_y_s_t_e_m _D_e_s_i_g_n  -
     _I_n_t_e_r_n_e_t_w_o_r_k_i_n_g  _w_i_t_h _X_I_N_U, Prentice-
     Hall (1987)

[4]  DEC/Intel/Xerox, ``The Ethernet  -  A
     Local  Area  Network  Data Link Layer
     and Physical Layer  Specification.'',
     Version 1.0 (Sep 30 1980)

[5]  Fox, T. L., ``AX.25  Amateur  Packet-
     Radio  Link-Layer Protocol'', Version
     2.0, ARRL (Oct 1984)

[6]  Griffiths, Georgia,  and  G.  Carlyle
     Stones,  ``The  Tea-Leaf Reader Algo-
     rithm: An Efficient Implementation of
     CRC-16  and  CRC-32'', Communications
     of the ACM, 30,7 (July 1987)

[7]  IEEE, _L_o_g_i_c_a_l _L_i_n_k _C_o_n_t_r_o_l, ANSI/IEEE
     Std 802.2-1985 (1984)

[8]  Postel, J.  et  al.,  ``DDN  Protocol
     Handbook'', USC-ISI (1986)

[9]  Tanenbaum,  A.,  _C_o_m_p_u_t_e_r   _N_e_t_w_o_r_k_s,
     Prentice-Hall (1981)

[10] Tanenbaum, A.,  _O_p_e_r_a_t_i_n_g  _S_y_s_t_e_m_s  -
     _D_e_s_i_g_n  _a_n_d _I_m_p_l_e_m_e_n_t_a_t_i_o_n, Prentice-
     Hall (1987)






















9


                           December 6, 1988


